An Ontology-based Term Weighting Technique for Web Document Categorization
نویسندگان
چکیده
منابع مشابه
Partitioning-based clustering for Web document categorization
Clustering techniques have been used by many intelligent software agents in order to retrieve lter and categorize documents available on the World Wide Web Clustering is also useful in extracting salient features of related web documents to automatically formulate queries and search for other similar documents on the Web Traditional clustering algorithms either use a priori knowledge of documen...
متن کاملTerm weighting based on document revision history
In real-world information retrieval systems, the underlying document collection is rarely stable or definite. This work is focused on the study of signals extracted from the content of documents at different points in time for the purpose of weighting individual terms in a document. The basic idea behind our proposals is that terms that have existed for a longer time in a document should have a...
متن کاملAn Ensemble Click Model for Web Document Ranking
Annually, web search engine providers spend more and more money on documents ranking in search engines result pages (SERP). Click models provide advantageous information for ranking documents in SERPs through modeling interactions among users and search engines. Here, three modules are employed to create a hybrid click model; the first module is a PGM-based click model, the second module in a d...
متن کاملTerm Weighting in Short Documents for Document Categorization, Keyword Extraction and Query Expansion
This thesis focuses on term weighting in short documents. I propose weighting approaches for assessing the importance of terms for three tasks: (1) document categorization, which aims to classify documents such as tweets into categories, (2) keyword extraction, which aims to identify and extract the most important words of a document, and (3) keyword association modeling, which aims to identify...
متن کاملAn Ontology Based Document Management
In this article an approach to the problem of associations of documents with a knowledge base is demonstrated in a real world application. It is based on combination of annotating documents with concepts from a knowledge base and grouping documents together into clusters. Our knowledge base is an ontology provided by a dedicated ontology server. 2 Introduction WWW is slightly becoming the most ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Procedia Computer Science
سال: 2018
ISSN: 1877-0509
DOI: 10.1016/j.procs.2018.07.010